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ABSTRACT 

The Schools and Staffing Survey (SASS) represents the 
first time the National Center for Education Statistics has 
integrated three of the Elementary and Secondary Education Surveys: 
the Teacher Demand and Shortage Surveys, Public and Private School 
Surveys, and Teacher Surveys. The SASS was designed to measure the 
critical aspects of teacher supply and deipand^ the composition of 
teacher and administrator work force, and the general status of 
teaching and schooling. The SASS was conducted by the Bureau of the 
Census in the 1987-88 school year. The SASS sample included 12,823 
public and private schools and administrators, 65,124 teachers, and 
5,592 Local Education Agencies (LEAs) . The merger of the three 
surveys produced one database to provide comparable linkage data 
among LEAs, schools, and teachers. Response rates ranged from 66% for 
private school teachers to 94.4% for public school administrators. 
Sample selection is discussed for public and private sectors. The 
discussion of estimation considers: weighting; item response rates 
and imputation; variance estimation; and frame evaluation. Data are 
reported in 18 tables. Three appendices discuss: minimizing school 
overlap with other national and longitudinal surveys; allocating 
sample groups to minimize loss of prediction for specific estimators; 
and the effect of a finite population correction on SASS variance 
estimates. (SLD) 
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Introduction 



General Goals 

The Schools and Staffing Survey (SASS) is the first time the 
National Center of Education Statistics (NCES) has integrated 
three of the Elementary and Secondary Education Surveys: the 
"Teacher Demand and Shortage Surveys," the "Public and Private 
School Surveys," and the "Teacher Surveys." SASS components are: 
"Teacher Demand and Shortage Survey", the "School and School 
Administrator Surveys", and the "Teacher Survey". The survey was 
designed to measure raore accurately the critical aspects of 
teaching supply and demand, the composition of the administrator 
and teacher work force, and the status of teaching and schooling 
generally. The SASS was conducted by the Bureau of the Census 
during the 87-88 school year. Approximately 13,000 schools and 
a' uixnistrators, 65,000 teachers, and 5,600 Local Education 
Agencies comprise the SASS sample. The idea behind the merger is 
to produce one database that would provide comparable linkage 
data between the LEAs (local education agencies) , schools, and 
teachers. In addition, SASS is the first time these three surveys 
have the same reference period. To accomplish this: 

1) Schools were selected first. Each selected school 
received a school and administrator questionnaire. 

2) A sample of teachers was selected, within each selected 
school. Each selected teacher received a teacher 
questionnaire . 

and 3) For public schools, the LEAs associated with the selected 
schools received a Teacher Demand and Shortage (TDS) 
questionnaire. Selected private schools received a TDS 
questionnaire, but were asked to respond only for the 
school . 

An additional requirement of the survey design is that there be 
minimal school overlap between the National Assessment uf 
Educational Progress (NAEP) and the National Education 
Longitudinal Study (NELS) surveys which also went to field in 
'88. The purpose of this requirement is to reduce the collection 
burden placed on the selected schools by NCES. This requirement 
was satisfied by adjusting the selection probabilities given the 
NAEP and NELS samples so that the overlap between the surveys was 
minimal. This was done in a fashion which maintained the original 
selection probabilities. 

The SASS is designed to provide the following analytical 
comparisons for each of the SASS surveys, except where otherwise 
noted: 

1 
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1) Public and private sector comparisoriS, 

2) State public sector comparisons, 

3) Elementary, secondary and combined school comparisons for 
tha public and private sectors, 

4) Detailed private sector affiliation comparisons, 

and 5) Fields of teaching assignment comparisons for public and 
private sector teachers. 

These requirements were satisfied by allocating the sample using 
an optimization algorithm which minimized the relative loss in 
precision with respect to sector, national, state and level 
estimates. 

The organization of this paper is to describe the procedures used 
for the: 1) school and teacher sample strat"Tm allocation, 2) 
minimization of overlap with NA'EP and NELS, 3) public school 
sample design, 4) LEA sample design, 5) private school sample 
design, 6) teacher sample design (including within school teacher 
allocation) , 7) weighting, 8) imputation, 9) variance estimation 
techniques and 10) frame evaluation. 

Response Rates 

Below are the weighted questionnaire response rates for the SASS 
components. These response rates are defined as the weighted 
number of in scope responding questionnaires divided by the 
weighted number of in scope sample cases. The overall response 
rate for a particular item (questionnaire response rate times 
item response rate) may be lower than the respective response 
rate below, because the item nonresponse rates are not included 
in the figures be]ow. 




Table 1. — Questionnaire response rates 

Teacher Demand and Shortage Survey 

Public LEAS 90.8% 
Private schools 66.0% 

School Administrator Survey 

Public 94.4% 
Private 79.3% 



School Survey 

Public 91.9% 
Private 78.6% 



Teacher Survey 

Public 86.4! 
Private 79. 1= 



A future paper will examine survey response rates and possible 
bias in more detail. 
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Scope of SASS 

The following survey terms define the scope of the components of 
the SASS. 

Teacher Demand and Shortage Survey 

1) Private School 

A private school was defined as a school not in the 
public system that provides instruction for any of grades 
1-12 where the instruction was not given exclusively in a 
private home. 

2) Local Education Agency (LEA) 

An LEA, or public school district, was defined as a 
government agency administratively responsible for 
providing public elementary and/or secondary instruction 
and educational support services. The agency or 
administrative unit must operate under a public board of 
education. Districts which operate only one school, as 
well as districts which do not operate schools but do 
hire teachers are included. 

3) Out-of -Scope 

An LEA or private school was considered out-of-scope for 
the Teacher Demand and Shortage Survey if it did not 
employ elementary or secondary teachers. In addition, if 
it was undetermined whether a private school operates in 
a private home and its enrollment was less than 10 
students and it had only one teacher, the school was 
considered out-of-scope. If a private school was 
classified as out-of-scope for the Private School Survey, 
it was automatically classified as out-of-scope for the 
Teacher Demand and Shortage Survey. 

School Survey 

1) Public School 

A public school was defined as an institution which 
provides educational services, has one or more teachers 
to give instruction, is located in one or more buildings, 
receives public funds as primary support, and is operated 
by an education agency. Prison schools, schools operated 
by the Department of Defense and the Bureau of Indian 
Affairs were included. 
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2) 



Private School 



See above for definition. 

3) Out-of-Scope (public schools) 

A public school was considered out-of-scope if it did not 
have any students in any of grades 1-12. Schools 
offering only kindergarten and pre-kindergarten were 
deleted from the sampling frame before the sample wp.s 
selected., If a school was determined to be out-of-scope 
after editing its questionnaire, it was deleted from the 
data tape. 

4) Out-of-Scope (private schools) 

A private school was considered out-of-scope if it did 
not have any students in any of grades 1-12, if it 
operated in a private home that was used as a family 
residence, or if it was undetermined whether it operated 
in a private home and its enrollment was less than 10 
students or it had only one teacher. Out-of-scope schools 
were deleted from the sampling frame before the sample 
was selected. If a school was determined to be out-of- 
scope after editing its questionnaire, it was deleted 
from the data tape. 

School Administrator Survey 

1) Out-of-Scope 

A school administrator sample case was consiaered 
out-of-scope if the school did not have an administrator. 
Also, if a sample administrator's school was considered 
out-of-scope, the administrator was automatically 
classified as out -of-scope. 



Teacher Survey 
1) Teacher 

A teacher is defined as any full-time or part-time 
teacher whose primary assignment was teaching in any of 
grader. K-12 • Itinerant teachers were included, a? v^ell as 
long-term substitutes who were filling the role a 
regular teacher on an indefinite basis. An itinerant 
teacher is defined as a teacher who teaches at more than 
one school. 
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2) 



Out-of-Scope 



A sample teacher was considered out-of-scope if he/she 
was a short-term substitute, a student teacher, ? 
nonteaching specialist (e.g., guidance counselor, 
librarian, etc. ) , an administrator (e.g. , principal, 
assistant principal, etc.), a teacher's aide, or in some 
other professional or support staff position. If a sample 
school was considered out-of-scope, all teachers from 
that school were also considered out-of-scope. 



If an LEA was classified as out-of-scope, its teachers, 
administrators and schools were alr.o classified as out-of-scope. 
Likewise if a school was classified as out-of-scope, its teachers 
and administrators wera also considered out-of-scope. In 
addition, if a private school was out-of-scope for the Private 
School Survey, it was also considered as out-of-scope for the 
Teacher Demand and Shortage Survey. 
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School and Teacher Allocation 
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School and Teacher Allocation 



In order to allocate the SASS school and teacher samples, it was 
necessary to determine the analytical goals for both school and 
teacher estimates. The major analytical schoo" and teacher goals 
in terms of desired estimates are listed below: 

1) Overall national estimates 

2) State comparisons for public sector 

3) School level (elementary, secondary, and combined) 
comparisons for both public and private sectors 

and 4) Public vs. private comparisons 

From these design goals, it was decided to stratify the school 
selection process by sector/state/level. A description of how the 
schools and teachers were allocated to these strata will be 
provided first. Then, additional design goals will be stated. 
Finally, how the sector/state/level allocation was modified to 
satisfy the additional goals will be described. 

Generally speaking, the following can be said: The optimum 
allocation for each of the above comparisons is quite different. 
For national estimates, schools and teachers should be allocated 
proportional to their numbers within each stratum. State 
comparisons are best when each state gets an equal sample 
allocation. Using the best state allocation would provide less 
efficient national estimates (e.g., California would get the same 
sample size as Delaware, even though California contributes a lot 
more to the national variance than Delaware). Likewise, the 
optimum allocation for level and sector are both different and 
what's good for one comparison is less efficient for the other. 
Therefore, a compromise allocation was determined. The compromise 
allocation weighted each of the goals stated above equally. 

The idea behind the compromise allocation was to look at the 
relative loss of efficipnry (i.e., the ratio of the variance for 
the proposed allocation to the variance for the optimum 
allocation) , for each of the four individual optimum allocations 
stated above (i.e., national, state, level and sector). The 
optimum allocations assumed a linear cost function with fixed 
total costs. These relative efficiencies were computed for both 
school and teacher estimates. The compromise SASS school and 
teacher allocation was the allocation that minimized the sum of 
these eight relative efficiencies (four school and four teacher). 
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An additional design goal was: 



5) for teacher estimates, to provide comparisons by 
teacher *s primary teaching assignment. 



To increase the number of selected teachers per school so that 
primary teacher assignment comparisons can be made (goal 5), the 
within school variance component of the total teacher variance, 
in the compromise allocation, was multiplied by the number of 
desired teacher assignments (the teacher assignments are provided 
in the teacher sample selection section under teacher sorting) . 
Appendix 2 provides more details about the allocation. 

Tables 2 and 3 provide the results from the compromise 
allocation. Table 2 provides the average number of selected 
teachers per school and table 3 provides the school allocation 
for the state by level stratum allocations. Table 4 provides the 
number of public LEAs selected for SASS. 

The final design goal was: 

6) for private schools, to provide comparisons by detailed 
affiliation. 

Table 6 provides the required sample sizes for the thirteen 
private school affiliations to satisfy goal 6. This table was 
produced by allocating at least 100 schools to each affiliation. 
If an affiliation had less than 100 schools on the frame then all 
schools in the affiliation were selected. The remainder of the 
private school sample was allocated proportional to number of 
schools in the affiliation. 

Within a state/level stratum, a raking procedure was used to 
distribute the private school allocation (table 3) to the 
thirteen affiliations. This process maintained the affiliation 
allocation in table 6, as well as the allocation in table 3. 

The private school stratification was then state by level by 
affiliation. Table 5 provides the number of schools in the 
private school list frame by affiliation/level strata. 
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Table 2. — Average number of selected teachers per school 
Public schools 

teachers/ school 
Elementary schools 4 
Secondary schools 8 
Combined ^schools 6 



Private schools 

teachers/ school 
Elementary schools 4 
Secondary schools 5 
Combined schools 3 
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Table 3. — SASS school allocation (state by school level) 
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Table 3. -- 



SASS school allocation (state by school level) 
— Continued 



Public schools 



Private schools 



State 


Elem . 


Sec . 


Comb . 


Total 


Elem. 
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Comb . 


Total 


Total 


Tennessee 


94 


50 


26 


170 


22 


6 


27 


55 


226 


Texas 


262 


224 


93 


579 


67 


10 


25 


102 


681 


Utah 


75 


28 


12 


115 


3 


1 


1 


6 


121 


Vermont 


74 


15 


4 


93 


7 


5 


6 


18 


110 


Virginia 


107 


69 


26 


203 


29 


6 


29 


64 


267 


Washington 


88 


55 


19 


163 


26 


4 


22 


51 


21 


West Virginia 


85 


33 


12 


131 


7 


2 


5 


14 


14b 


Wisconsin 


98 


62 


22 


182 


95 


9 


19 


123 


305 


Wyoming 


70 


21 


7 


99 


7 


0 


2 


9 


107 


Potal teachers 


19975 


23257 


8582 


51814 


8070 


2045 


3195 


13310 


65124 
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Table 4. — Number of sampled public LEAs 





State 


LEAs 




Total 


5592 




1 Alabama 


92 




2 Alaska 


40 




3 Arizona 


76 




4 Arkansas 


104 




5 California 


354 




6 Colorado 


64 




7 Connecticut 


87 




8 Delaware 


19 


9 


D. of Columbia 


1 




10 Florida 


53 




11 Georgia 


110 




12 Hawaii 


1 




13 Idaho 


61 




14 Illinois 


2 66 




15 Indiana 


137 




16 Iowa 


132 




17 Kansas 


108 




18 Kentucky 


99 




19 Louisiana 


66 




20 Maine 


83 




21 Maryland 


24 


22 Kassachusetts 


148 




23 Michigan 


208 




24 Minnesota 


139 




25 Mississippi 


95 




26 Missouri 


147 




27 Montana 


85 




28 Nebraska 


100 




29 Nevada 


17 


3 0 New Hampshire 


66 




31 New Jersey 


180 




32 New Mexico 


53 




3 3 New York 


294 


34 


North Carolina 


105 




3 5 North Dakota 


78 




36 Ohio 


275 




37 Oklahoma 


142 




3 8 Oregon 


90 




39 Pennsylvania 


244 




40 Rhode Island 


31 


41 


South Carolina 


69 



14 



o 2(1 

ERIC 



Table 4. — Number of sampled public LEAs — Continued 



42 South Dakota 


80 


43 Tennessee 


84 


44 Texas 


350 


4 5 Utah 


3C 


4 6 Vermont 


78 


47 Virginia 


92 


48 Washington 


95 


49 West Virginia 


57 


50 Wisconsin 


141 


51 Wyoming 


44 
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Table 5. — List frame sizes for private school affiliations 



Frame sizes 



Affiliation 




El ementarv 


Secondarv 


Combined 


Total 


Total 




14653 


2465 


7 064 


' 182 


Catholic 




7618 


1377 


184 


9179 


Friends 




26 


8 


30 


63 


Episcopal 




177 






9R7 


Jewish 




276 


110 


186 


572 


Lutheran 




1502 


81 


76 


1659 


Seventh-Day Adventist 


779 


62 


451 


1292 


Christian Schools Intl 


146 


31 


80 


257 


Amer Assoc ' 


Christ. Schools 139 


6 


901 


1046 


Exceptional 


Children 


19 


10 


124 


153 


Military Schools 


0 


17 


9 


26 


Montessori 




228 


1 


64 


293 


Independent 


Schools 


240 


257 


437 


984 


Other 




3505 


466 


4400 


8371 



Table 6. -- Sample 


sizes for private school 


affiliations 








List frame 






Affiliation 


Elementary 


Secondary 


Combined ' 


Total 


Total 


1705 


353 


910 


2968 


Catholic 


637 


131 


12 


781 


Friends 


26 


8 


30 


63 


Episcopal 


72 


20 


28 


120 


Jewish 


76 


29 


37 


142 


Lutheran 


202 


12 


'a 


222 


Seventh-Day Adventist 


125 


10 


60 


195 


Christian Schools Int" 


74 


15 


30 


118 


Amer Assoc Christ. Schools 25 


1 


150 


177 


Exceptional Children 


17 


7 


86 


110 


Military Schools 


0 


17 


9 


26 


Montessori 


97 


1 


23 


121 


Independent Schools 


45 


54 


73 


172 


Other 


309 


49 


363 


721 


additional 551 schools were allocated 


to the area 


frame sample 


(see 



frame sample section for more information) . 
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Sample Selection 
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Minimizing the School Overlap between the NAEP and NE LS Samples 



The collection period for the NAEP (National Assessment of 
Educational Progress) and NELS (National Education Longitudinal 
Study) surveys was the same as the SASS survey. To minimize 
respondent burden from NCES surveys, it was decided to minimize 
the school overlap between the three surveys. Each survey used 
the Qualit. Education Data (QED) file as a school frame. Hence, 
it was easy to match schools between the surveys. Each survey, 
however, did independent frame refinements, so the frames did not 
match exactly. Given the time constraints to do this process, the 
minimization process was done only on schools that matched on 
either the NAEP or NELS frames via the QED identification number. 
Since SASS was by far the largest survey, SASS was selected last. 
This reduced the impact of the potentially large SASS ' rlap on 
NAEP and NELS. NAEP selected first; NELS selected nex^ .nd 
unduplicated the NAEP sample from the NELS frame before selecting 
their sample. SASS selected last and minimized the overlap 
between both the NAEP and NELS samples. 

The approach taken by SASS was to minimize the school overlap in 
a fashion such that the SASS selection probability for a school 
averaged over all possible NAEP, NELS and SASS samples equaled 
the original probability of selection, if no unduplication was 
required. Taking this approach meant giving some NAEP and NELS 
sampled schools a small probability of selection. However, by 
maintaining the original probability of selection, the impact the 
unduplication process had on SASS estimates was minimized. Only 
four SASS schools overlapped with the NAEP sample and there was 
no overlap with the NELS sample. For more details on the 
minimization process see appendix 1. 
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Public School and LEA Sample Selecti on 
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Publi c Snhool Sample 



This section describes the frame, stratification, sorting and 
sample selection. The school allocation was described earlior in 
the School and Teacher Allocation section. In total, 9317 public 
schools were selected. 

The SASS school sample was selected to minimize the overlap 
between the NAEP and NELS school samples. See section. Minimizing 
the Overlap between the NAEP and NELS School Samples , for a 
description of that process. 



The public school frame was the 1986 Quality Education Data (QED) 
file. The frame included regular public schools. Bureau of Indian 
Affairs schools and Department of Defense schools. Nonre lular 
schools such as special education, vocational or technical 
schools were also included in the sample frame. Before sampling, 
duplicate schools and Department of Defense Schools outside of 
the United States were removed from the frame. Schools that only 
teach prekindergarten, kindergarten or adult education were also 
removed. There were a total of 80,384 schools on the public 
school frame. Some public schools were missing from the frame 
(see frame evaluation section) . However, the QED file was 
believed to be the best frame available at the time. 



Stratification 

Public schools had a total of 153 strata. 

The first level of stratification was State (51) — each of the 
50 States and the District of Columbia. 

Within each state there were 3 grade level strata (elementary, 
secondary, and combined school) , defined as follows: 



Frame 



Number of schools on public frame 



School type 



Number of schools 



Elementary 

Secondary 

Combined 



54,463 
18,737 
7, 184 
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Regular Schools 
Elementary Lowest grade < 6 and Highest grade < 8 

Secondary Lowest grade > 7 and Highest grade < 12 

Combined Lowest grade < 6 and Highest grade > 8 



Nonreqular School 
Nonregular schools, which include special education, vocational, 
technical, adult education (if part of in-scope school) or 
alternative/continuation grades were classified as combi^-ed 
schools. 



School Sorting 

Before the sample was selected, the schools within each stratum 
were sorted. To facilitate the calculation of LEA weights, it was 
important to keep all schools within a stratum/LEA together. To 
accomplish this, some of the sort variables' values were changed 
to keep them the same for every school within a stratum/LEA. They 
were changed in the following manner: 

1) All schools within a stratum/LEA had the first three 
digits of the ZIP code set equal to the ZIP code of the 
first school in the stratum/LEA. 

2) All schools within a stratum/LEA had the urbanicity code 
changed to the urbanicity code most prevalent among the 
first 11 schools within the stratum/LEA. If there were 
less than 11 school in the LEA, the most prevalent value 
of urbanicity was used. If there was a tie the lower 
value was used. 

After tl.ese fields were changed the schools within a stratum were 
sorted by the following variables: 

1) LEA urbanicity — 0 - unclassified, 1 - urban, 

2 - suburban and 3 - rural ; 

2) LEA percent — 1 - (0-5%), 2 - (6-20%), 

minority 3 - (21-50%) , and 4 - (51% or more) ; 

3) LEA Zip code — The first three digits were used; 

4) LEA ID number; 

5) Highest grade in school; 

6) School enrollment; and 
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7) PIN number — The PIN number is a unique number assigned 
by QED which identifies the school. 

Sample Selection 

Within each stratum, schools were systematically selected using a 
probability proportionate to size algorithm. The measure of size 
used was the square root of the number of teachers in the school 
as reported on the QED file. Any school with a measure of size 
larger than the sampling interval was excluded from the 
probability sampling process and included in the sample with 
certainty. 
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LEA vSample for Public Schools 



During the initial c'r^.ign development of the SASS, selecting the 
LEAs first and then selecting schools within LEAs was considered. 
It was hypothesized that doing this would reduce the reliability 
of both school and teacher estimates. Simulations done on the 
reliability of LEA estimates when the LEAs were selected first 
confirmed this. The simulations also showed that selecting 
schools "first" would produce reasonably accurate LEA estimates. 
For these reasons' the SASS sample selected schools first. 

Hence, the LEA sample consists of the set of LEAs that were 
associated with the SASS public school sample. This provides the 
linkage between the LEA and the school. Each Bureau of Indian 
Affairs school and Department of Defense school was defined to be 
an LEA. This portion of the I-EA sample represented the set of 
LEAs associated with schools. 

Some LEAs were not associated with schools. Such LEAs may hire 
teachers that teach in schools from other LEAs. For SASS to 
represent such LEAs, a sample of these LEAs was also selected. 
The frame for this sample consisted of all LEAs on the '86 QED 
file that were not associated with schools. 1,077 LEAs were on 
this frame. 70 LEAs were selected and only 8 of the 70 sampled 
LEAs were actually in-scope (i.e., reported hiring teachers in 
SASS) . 

The frame was stratified into two strata: the first stratum 
consisted of LEAs not associated with schools that reported at 
least one teacher on the QED file; the second stratum consisted 
of LEAs not associa'-ed with schools that reported zero teachers 
on the QED file. 

For the first stratum, a 1 in 10 sample was taken. The sample was 
selected using a systematic probability proportionate to size 
algorithm. The measure of size was the square root of the QED 
number of teachers. The sort variables were: 

1) State (51) — one for each state and the District of 

Columbia ; 

2) Urbanicity (3) — 1 - urban, 2 - suburban, and 

3 - rural; 

3) Percent minority (4) i - (0-5%), 2 - (6-20%), 

3 - (21-50%), and 4 - (51% or 
more) ; 

4) First three digits of Zip code; 
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and 5) LEA ID. 

For the second stratum, a 1 in 2 0 sample was taken. The sample 
was selected using a systematic equal probability algorithm. The 
sort variables were: 

1) State (51) — one for each state and the District of 

Columbia ; 

2) Urbanicity (3) — 1 - urban, 2 - suburban, and 3 - rural; 

3) Percent minority (4) — 1 - (0-5%), 2 - (6-20%), 

3 - (21-50%), and 4 - (51% or 
more) ; 

4) First three digits of Zip code; 
and 5) LEA ID. 

Delaware. Nevada and West Virginia LEAs 

For each state, a simulation study was done to assess the 
reliability of SASS LEA estimates. It showed that standard errors 
from Delaware, Nevada and West Virginia were very high relative 
to the LEA sampling rate (i.e., CVs larger than 2 0 percent with 
90 percent of LEAs in sample) . Since almost all LEAs were 
selected in those states, it was decided to include the 
nonselected LEAs in the LEA sample. Hence, all LEAs in these 
states were selected with probability one. 
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Private School Sample 



This section describes the frame, stratification, sorting and 
private school selection. The school allocation is described in 
the School and Teacher Allocation section. 

Frames 

3,513 private schools were selected using a dual frame approach. 
A list frame was the primary private school frame and an area 
frame was used to find schools missing from the list frame and 
thereby compensate for the coverage problems of the list frame. 

List Frame 

The list frame used for private schools was a supplemented 1986 
QED file. Various private school associations were asked to 
supply lists of their schools. Seventeen such lists were 
received. These lists were matched with the QED list and any 
association list school not found on the QED file was added to 
the frame. Before sampling, duplicate schools were excluded from 
the frame. Schools that only teach prekindergarten , kindergarten 
or adult education were also removed. The list frame consisted of 
22,600 schools from the QED file and 1,586 schools added on from 
the association lists. 



Area Frame 

The area frame sample PSUs were originally selected by Westat for 
the 1983 Private School Survey. They used a dual frame design 
similar to the SASS design. The area frame consisted of 2,497 
primary sampling units (PSUs) . Each PSU consisted of a single 
county or independent city or cluster of such geographically 
contiguous areas defined so that each PSU had a minimum 
population of 10,000 according to the 1980 Census of Population. 
To avoid having PSUs covering too large a geographic area some 
PSUs had less than 10,000 in population. Counties of Alaska were 
excluded. 

Eight PSUs with populations greater than 1,700,000 were included 
in the sample with certainty. 

The strata were: a) census region (4 levels), b) metro/nonmetro 
status (2 levels) and c) whether the PSU's private school 
enrollment exceed^ed the median enrollment of the other PSUs in 
the census region/metro status strata (2 levels) . 

A minimum of four PSUs were allocated to each of the 16 strata 
(64 PSUs) . 28 additional PSUs were allocated to the 16 strata to 
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more nearly approximate a uniform fraction of PSUs in the sample 
from each stratum. 

The PSUs were selected as a systematic sample with probability 
proportionate to the square root of the 1980 PSU population. A 
total of 100 PSUs were in sample. Due to budget constraints 
during the '83 Private School Survey, the 100 PSUs were 
sutasampled to 75 PSUs with a minimum of 3 selected PSUs per 
stratum. These 75 PSUs were used in the '88 SASS survey. 

Area Sample School Frame Building 

Within each of the 75 Westat PSUs, the Census Bureau attempted to 
find all eligible private schools (i.e., nonpublic schools 
providing the following: instruction for any grades 1-12, 
instruction not provided exclusively in the home, a school year 
at least 160 days long and a normal school day at least 4 hours 
long) . An area canvas was not attempted. However, regional staff 
were used to create the frame using such sources as: yellow 
pages, non-roman catholic religious institutions, local education 
agencies, chamber of commerce, local government offices, 
commercial milk companies and commercial real estate offices. 
Roman Catholic religious institutions were not contacted because 
QED calls each Catholic diocese during its annual list update. 
Once these lists were constructed, they were matched with the 
updated QED file. Schools that did not match the QED list were 
contacted to make sure they were eligible schools. 746 eligible 
and 250 unable-to-contact schools comprised the area school 
frame. The 746 eligible and 250 unable-to-contact school weight 
up to respectively 6,909 and 753 schools nationally. 



Frame Summary 

The total number of schools from the list and area school frames 
are provided in table 7 below. 
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Table 7. — Private school frames 

Unweighted count Weighted count 

List frame 

Total 24,186 24,186 

QED list 22,600 22,600 

Association 1,586 1,586 
add-ons 

Area frame 

Total 996 7,662 

Eligibles 746 6,909 

No contacts 250 753 
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Private School List Sample 



Stratification 

For private schools, the frame was partitioned into an initial 
set of 1,989 cells. Only cells with schools in them comprised the 
sampling strat-- . Some of the initial cells didn't have any 
schools. 

The first level of stratification was state (51) — 1 for each 
state and the District of Columbia. 

Within each state, school affiliation (13) was the next level of 
stratification : 



1) 


Catholic schools 


2) 


Friends schools 


3) 


Episcopal schools 


4) 


Jewish schools 


5) 


Lutheran schools 


6) 


Seventh-Day Adventist schools 


7) 


Christian Schools International 


8) 


Association of Christian Schools 


9) 


Association of Private Schools for Exceptional Children 


10) 


Association of Military Colleges and Schools of the U.S 


11) 


American Montessori Society Schools 


12) 


National Association of Independent Schools 


13) 


All Else 



Within each state/affiliation, schools were stratified by school 
level (elementary, secondary, and combined schools) . Tha 
definitions are provided below: 
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liar Schools 

Elementary Lowest grade < 6 and Highest grade < 8 
Secondary Lowest grade > 7 and Highest grade < 12 
Combined Lowest grade < 6 and Highest grade > 8 



Nonregular School 
Nonregular schools, which include special education, vocational, 
technical, adult education (if part of in-scope school) or 
alternative/continuation grades were classified as combined 
schools . 



Due to the raking procedure, described in the school allocation 
section, some strata did not have any schools designated in the 
sample; in such cases, strata were collapsed with other strata 
until all strata had a least one designated sample school. 

School Sorting 

Within each stratum, sorting took place on the following 
variables: 

1) Urbanicity — 0 unclassified, 1 urban, 2 suburban, 3 

rural and 4 affiliation add-on with no 
classification ; 

2) Zip code — The first two digits were used; 

3) Highest grade in the school; 

4) Enrollment; 

and 5) PIN number — The PIN number is a unique number assigned 

by QED which identifies the school. 

Sample Selection 

Within each stratum, schools were systematically selected using a 
probability proportionate to size algorithm. The measure of size 
used was the square root of the QED number of teachers in the 
school. Any school with a measure of size larger than the 
sampling interval was excluded from the probability sampling 
process and included in sample with certainty. 
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Area Frame Sample 



Allocation 

551 private schools were selected from the area sample school 
frame. In most states, all area frame schools were in sample. 
However, in six states: California, Florida, Illinois, Montana, 
Nevada and Texas, the number of area frame schools exceeded 30 
percent of the state's total allocation. To maintain a reasonable 
balance between the numbers of list and area frame sample cases, 
in these six states the area frame schools were subsampled to 3 0 
percent of the state's total allocated sample. After a state 
sample size was determined for the area frame, sample was 
allocated to PSUs so as to maximize the "self-weightedness" of 
the school sample, (i.e., minimize the range of the weights). The 
sample sizes for the area sample school frame are provided in 
table 8. 

Table 8. — Sample sizes for the area sample school frame 



Sampling rates 



Elementary 



Secondary 



California 


46 


of 


12 3 I 


9 of 13 


26 


Of 


45 I 


37 of 198 


Florida 


19 


of 


56 


3 of 7 


22 


of 


26 


4 of 8 


Illinois 


40 


of 


74 


7 of 8 




of 


45 


6 of 13 


Montana 


1 


of 


1 


1 Of 1 




of 


5 


1 of 2 


Nevada 


4 


of 


5 












Texas 


22 


of 


49 ! 


3 of 7 


! 8 


of 


23 


1 1 of 6 



Combined | Unable to 
contact 



All area frame schools in the other states were selected with 
certainty. 



— means there were no cases in the frame to sample. 
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For the six states, the following sampling scheme was 
implemented. 



Stratification 

Schools were stratified by the following variables: 

1) State; 

2 ) PSU ; 

3) Two strata 

a. Eligible schools - Schools that were contacted and 

verified in-scope; 

and 

b. Unable-to-contact schools - Schools who could not be 

contacted and therefore it 
was not possible to verify 
in-scope status . Sort 
variables (grade level, 
affiliation and 
enrollment) could not be 
collected for these 
schools ; and 

4) School level - Eligible schools were stratified by grade 

level (elementary, secondary, and 
combined) . In some cases, when the grade 
level was unknown, it was imputed. 

Since the school level was unknown for 
the unable-to-contacts, they were not 
stratified by this variable. 

Sort Variables 

Eligible schools were sorted using the following variables: 

1) Affiliation - Catholic, other religious, and 

nonsectarian. In sonie cases, when the 
affiliation was unknown, it was imputcid 
based on the name; 

2) Enrollment; 

and 3) Alphabetical order of name. 
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Unable-to-contact schools were sorted by alphabetical order of 
name. 



Measure of Size 

For eligible schools, the measure of size was the square root of 
the number of reported teachers. For unable-to-contact schools, 
this measure of size was unknown so they were assigned a measure 
of size equal to le. 

Sample Selection 

Within each stratum, eligible schools were systematically 
selected using a probability proportionate to size algorithm. The 
measure of size used was described above. Any school with a 
measure of size larger than the sampling interval was excluded 
from the probability sampling process and included in sample with 
certainty. 



35 



38 



Public and Private Teacher Sample Selection 



This section describes the frame, stratification, within school 
allocation, sorting, and the selection for the public and private 
teacher sample. 

Selecting the teacher sample involved the following steps. First, 
the selected schools were asked to provide teacher lists for 
their schools. From the lists, 56,242 public and 11,529 private 
teachers were selected. Teachers could be selected in one of two 
ways: 

1) Most teachers were selected as part of the basic teacher 
sample. These teachers were allocated to approximately 
equalize the teacher weights within a school stratum; 

or 

2) 2447 bilingual/ESL teachers were selected independently 
from the basic sample. This sample was designed to 
produced reliable bilingual/ESL teacher estimates in 
California, Texas, Florida, Illinois, New York and all 
other states. 



The public and private teacher samples will be described together 
because they were selected using identical methodology. The only 
differences were in the rate new teachers were oversampled and 
the average number of teachers selected within a school. 

The details of the teacher selection are provided below. 

Teacher Frame 

Each selected school was asked to provide a list of their 
teachers. For each teacher on the list, the following was to be 
specified: 

1) New/experienced - Teachers in their first, second or 

third year of teaching should be 
classified as a new teacher. 

2) Bilingual/ESL - Teachers who use NATIVE LANGUAGE to 

instruct students with limited English 
proficiency (bilingual) 

or 

teachers providing students with limited 
English proficiency with intensive 
instruction in English (English as a 
Second Language) . 
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Field of Teaching - 



Elementary teachers should be 
classified as: general elementary, 
special education or other. 



Secondary teachers depending on 
primary teaching subject taught 
should be classified as: math, 
science, English, social studies, 
vocational education or other. 

The above information for each teacher in a selected SASS scho 
comprises the school teacher frame. 

Twelve percent of the private schools and four percent of the 
public schools did not provide teacher lists. For these school 
no teachers were selected. A factor in the teacher weighting i 
used to adjust the weights to reflect the fact that some schoo 
did not provide teacher lists. 
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Basic Teacher Sample 



Teacher Stratification 

Within each selected school, teachers were stratified by their 
new/experienced classification. This was done to allow for the 
possibility of oversampling these groups in order to ensure 
sufficient sample for analytical purposes. 

Within School Teacher Allocation 

For public schools, it was decided that new/experienced teachers 
did not require oversampling. Therefore, teachers were allocated 
to the new and experienced categories proportional to their 
numbers in the school. However, for private teachers, to ensure 
that there would be enough new teachers in the Teacher Followup 
Survey (TFS) , it was decided to oversample new teachers. 

Before teachers were allocated to the nev//experienced strata, the 
school was first allocated an overall number of teachers to be 
selected. This overall sample size was chosen so as to equalize 
the teacher weights within the school stratification (state/level 
and state/affiliation/level for public and private sectors, 
respectively), assuming no further teacher stratification. 

Table 9 provides the average number of teachers to be selected 
within a school. 

Table 9. — Average selected teachers per school 

School Level 
Elementary Secondary Combined 

Public Schools 4 8 6 

Private Schools 4 5 3 



Given the numbers in table 9, the overall teacher sample size was 
chosen to equalize the teacher weights within a school stratum. 
Since the school sample was selected proportional to the square 
root of the number of teachers in the school, an equally weighted 
teacher sample within a school stratum was obtained by selecting 
t. teachers in school i. 
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t. = W,*T,(C/Y) 



1 



Where: W. is the school weight for school i (the 
inverse of the school selection 
probability) . 



1 



T; is the number of teachers in school i, 
as reported on the teacher li:t form. 

is the average number of teachers 
selectr ' r»er school (See table 9) . 

is the t^xmi^xe average of the school's 
weighted measure of size over all 
schools in the school stratum. The 
weighted measure of size is defined 
below. 

For noncertainty schools, the weighted 
measure of size equals school sampling 
interval times the square root of the 
QED number of teachers in school. The 
measure of size for certainty schools 
is the QED number of teachers in the 
school . 



To make sure a school wasn't overburdened, the maximum number of 
teachers per school was set at twice the average number of 
teachers per school from table 9. Each school was asked for at 
least one teacher. 

65,124 teachers were designated for selection, while 65,327 were 
actually selected. This slight difference was due to the fact 
that in allocating the sample, Y was based on QED teacher counts 
instead of reported teacher counts from the school. This caused 
the overall average number of teachers per school to be slightly 
different than the target numbers in table 9. 

Given the overall school allocation of teachers, t., teachers 
were allocated to the new/experienced strata, t^,. and t^. 
respectively, in the following manner. 



tni=(A*T,>t.)/(T,,+A*T„,) 

and 

toi=(Tei*t.)/(T^^+A*T^J 



where: A is the oversampling factor (see table 
10) . 

T . is the number of new teachers in 
school 1. 

Tg. is the number of experienced teachers 
in school i. 



Table 10. — Oversampling factor for new teachers (A) 

A (oversampling factor) 

Pablic teachers 1.0 
Private teachers 1.6 



The oversampling in private schools was done to ensure the 
ability to analyze new private school teachers for both the SASS 
and the TFS surveys. The expected public new teacher sample size 
was large enough to meet the NCES ' s analysis plans without 
oversampling. 

Teacher Sorting 

Within each new and experienced teacher stratum, teachers were 
sorted by primary field of teaching. Elementary teachers were 
sorted by general elementary, special education or other teaching 
assignment. Secondary teachers were sorted by math, science, 
English, social studies, vocational education or other teaching 
assignment. When combined schools had both elementary and 
secondary teachers, the teachers were sorted by grade 
level/primary field of teaching. This was done to assure a good 
distribution of teachers by field of teaching. 

Teacher Selection 

Within each school and teacher stratum, teachers were selected 
systematically with equal probability. 
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Bilingual /ESL Teacher Supplemental Sample 



Independent of the teacher selection process above (the basic 
sample) , a sample' of bilinguai/ESL teachers was selected from the 
teacher lists from both public and private SASS school samples, 
using the bilingual/ESL code. This supplemental sample was 
designed to provide estimates for California, Texas, Florida, 
Illinois, New York, and the rest of the United States. 

Bilingual Supplement Stratification 

There were six strata: 

1) California 

2) ^ Texas 

3) Florida 

4) Illinois 

5) New York 

6) All other states 

Bilingual Sample Allocation 

The goal of the bilingual supplement was to be able to analyze 
bilingual teachers in the strata stated above. To achieve this 
goal, it was decided to select approximately 500 teachers in each 
strata. Since accurate counts of bilingual/ESL teachers were not 
available, determining the expected number of bilingual/ESL 
teachers in the selected SASS schools was very difficult. Since 
teachers were selected on a flow basis as the lists came in from 
the field, an estimate could not be made using SASS. To minimize 
this problem, conservative sampling rates were used so that more 
than the required number of teachers would be selected. After a 
specified time, estimates of final sample sizes were made and a 
sampling reduction rate was determined and applied to provide the 
desired sample sizes. 

The sample size within each school was chosen to be proportional 
to the weighted number of bilingual teachers in the school (i.e., 
proportional to the inverse of the school selection probability 
times the number of bilingual teachers in the school) . 

No school was asked to provide a sample of more than 2 0 teachers. 
If the sum of the basic and bilingual teacher samples was greater 
than 20 in any school, the bilingual sample was systematically 
reduced to make the total teacher sample equal to 20. Table 11 
provides the actual number of selected bilingual teachers in the 
final sample. 
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Table 11. — Number of bilingual teachers in the supplement 



State 



Number of teachers 



Total 



2,447 



Illinois 
Florida 



195 
190 
513 
563 
468 



All SASS bilingual teachers selected 
All SASS bilingual teachers selected 



California 
Texas 
New York 



Rest of U.S', 518 



Some state allocations were greater than 500 because the sample 
reduction rates were computed due to timing constraints before 
all teachers were selected. 

Sort Variables 

Within each school, teachers were sorted by the new/experienced 
category by primary field of teaching. Elementary teachers were 
sorted by general elementary, special education or other teaching 
assignment. Secondary teachers were sorted by math, science, 
English, social studies, vocational education or other teaching 
assignment. This was done to assure a good distribution oi 
teachers by field of teaching. 

Teacher Selection 

Teachers were selected systematically with equal probability 
among those schools having bilingual/ESL teachers. 
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Teacher Sampling for Telephone Followup 



Given the size of the teacher sample (67,771 basic and 2,447 
bilingual/ESL teachers) and the fact that this was the first time 
SASS was fielded, the Census Bureau was not sure there was enough 
time to do telephone followups for all teachers not reporting by 
mail. To handle this situation, the teachers selected above were 
systematically placed into 100 approximately equal sized groups. 
Once the mail collection was closed out, this provided the 
capability of producing different sized samples of non-mail 
return teachers for telephone followup collection. Hopefully, all 
non-mail return teachers would be followed up. However, if time 
or money constraints became critical, this provided the 
capability to reduce the number of followup cases. In fact, 57 
percent of the non-mail return teachers (i.e., 23,365 teachers), 
were selected for telepnone followup. Therefore, 57 of the 100 
groups were randomly chosen for telephone followup. 
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Weighting 

This section describes the weighting processes for the SASS 
samples. For each questionnaire, the formula for the weight will 
be presented, along with a brief description of each component. 
When computations are done within cells, such as nonresponse 
adjustments, the cells will be described. Sometimes a cell did 
not have enough data to produce a reliable estimate; in such 
cases, cells were collapsed. The least important variables were 
always collapsed first. The collapse criteria are also described. 

First the school weight will be described. Since the public and 
private school weights have the same structure, they will be 
presented together. They differ only ^n the definition of the 
cells used to compute the nonresponse adjustment and the 
first-stage ratio adjustment factors. These cells will be 
described se^- rately within the school weight section. 

Since the public and private administrator weights, as well as 
the private teacher demand and shortage wrj.7hts are similar to 
the school weights, they will be described next. 

In the third section, the public teacher demand and shortage 
weights will be described. 

In the last weighting section, the teacher weights will be 
described. Since the public and private weights have the same 
scructure, they will be presented together. They differ only in 
the definition of the cells used to compute the various weighting 
factors. These cells will be described separately within the 
school weight section. 
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School Weight 



The final weight for the public and private school data is: 

(Basic Weight) X (Sampling Adjustment Factor) X (School 
Noninterview Factor) X (Frame Ratio Adjustment Factor) 

Where: Basic Weight is the inverse of the probability of 

selection of the school. 



Sampling Adjustment 
Factor 



is an adjustment that 
accounts for unusual 
circumstances that affect 
school's probability of 
selection, such as merger, 
split or duplication. 



the 



School Noninterview 
Adjustment 



is an adjustment that accounts 
for total school nonresponse. 
It's the weighted (basic 
weight*sampling adjustment 
factor) ratio of total 
eligible in-scope schools to 
the total responding in-scope 
schools within cells. 



Frame Ratio 

Adjustment Factor This factor adjusts the sample 

estimates to known frame totals. 
For public schools, it is equal to 
the ratio of the total number of 
SASS frame noncertainty schools to 
the weighted sample estimate of 
total number of noncertainty 
schools in the frame. For private 
schools, the adjustment is the 
same, except for the area frame. 
/ For the area frame, instead of 
SASS frame totals, weighted (PSU 
weight) area frame totals are used 
in the numerator. This ratio is 
calculated within cells. 
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Cells 



The school noninterview and frame ratio adjustments are computed 
within cells. 

Public 

For public schools, the noninterview adjustment cells were: state 
by school level by enrollment size class by urbanicity. If the 
factor was less than 1.5 and there were at least 15 schools in 
the cell, no collapsing was done. Otherwise, cells were collapsed 
(enrollment size first, urbanicity second and level third) . The 
frame ratio adjustment cells were: state by grade level by 
urbanicity. The results of the QED factors were reviewed by 
Census Bureau staff. If collapsing was necessary (e.g., where a 
cell contains QED schools, but no SASS schools), urbanicity was 
collapsed first and grade level was collapsed last. 

Private 

For private list frame schools, the noninterview adjustment cells 
were: census region by affiliation (Catholic, other religious and 
nonsectarian) by grade level. If the factor was less than 1.5 and 
there were at least 15 schools in the cell, no collapsing was 
done. If collapsing was done, urbanicity was collapsed first, 
grade level second and affiliation last. The frame ratio factor 
cells were: grade level by urbanicity by affiliation (thirteen 
sampling strata affiliations) . The results of the frame factors 
were reviewed by Census Bureau staff. If collapsing was necessary 
(e.g., where a cell contains QED schools, but no SASS schools), 
urbanicity was collapsed first, grade level second and 
affiliation last. 

For private area frame schools, the noninterview adjustment cells 
were: census region by affiliation (3 levels) by grade level. 
Within census region, the unable-to-contact schools were another 
cell by themselves. If the factor was less thai) 1.5 and there 
were at least 15 schools in the cell, no collapsing was 
necessary. If collapsing was necessary, grade level was collapsed 
first, and affiliation was collapsed last. The frame ratio factor 
cells were: grade level by urbanicity. The results of the frame 
factors were reviewed by Census Bureau staff. If collapsing was 
necessary (e.g., where a cell contains QED schools, but no SASS 
schools) , urbanicity was collapsed first and grade level was 
collapsed last. 
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Administrator and Private Teacher Demand and Shortage 



Questionnaires 



The public and private administrator weighting, as well as the 
private teacher demand and shortage weighting was done the same 
way as the school questionnaire weighting described above. Since 
the respondents for each of th^^^se surveys were different, the 
weighting process was done separately for each questionnaire. The 
sum of the administrator weights may not equal the sum of the 
school weights because some schools do not have administrators. 
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Teacher Demand and Shortage for Public Distri cts 



The final weight for the district data is: 

(Basic Weight) X (sampling Adjustment Factor) X (LEA 
Noninterview Factor) X (Frame Ratio Adjustment Factor) 

Where: Basic Weight is the inverse of the probability of 

selection of the LEA. See section 
below for more details. 

Sampling Adjustment 

Factor is an adjustment that 

accounts for unusual 
circumstances that affect the 
LEA'S probability of 
selection, such as merger, 
split or duplication. 

Noninterview Adjustment is an adjustment that 

accounts for total LEA 
nonresponse. It's the 
weighted (basic 
weight* sampling adjustment 
factor) ratio of total 
eligible in-scope LEAs to 
the total responding 
in-scope LEAs, computed 
within cells. 



This factor adjusts the sample 
estimates to known QED totals. 
It's the ratio of the total 
number of QED noncertainty LEAs 
to the weighted sample estimate 
of the total number of 
noncertainty LEAs in the frame, 
computed within cells. 



Cells 

The noninterview and frame ratio adjustments are computed within 
cells . 

The noninterview adjustment cells were: state by LEA enrollment 
size class by percent minority enrollment size class. If the 
factor was less than 1.5 and there were at least 10 LEAs in the 
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Frame Ratio 
Adjustment Factor 



cell no collapsing was done. Otherwise, cells were collapsed 
(percent minority enrollment first and enrollment last) . The 
frame ratio adjustment cells were: state by LEA enrollment size 
class level by urbanicity. The results of the frame factors were 
reviewed by Census Bureau staff. If collapsing was necessary 
(e.g., where a cell contains QED LEAs, but no SASS LEAs) , 
urbanicity was collapsed first and LEA enrollment was collapsed 
last. 

LEA Basic Weights 

Given the complexity of the sampling scheme, LEA basic weights 
are not straight forward. There are three situations that need 
discussion: LEAs with school, LEAs without schools and LEAs in 
Delaware, Nevada and West Virginia which have two sets of basic 
weights . 



LEAs with Schools 



The LEA sample was not selected directly through the LEA. 
Instead, the LEAs were selected through the school (i.e., the 
LEAs associated with the school sample comprised the LEA sample) . 
The basic weight, therefore, is m^ire complicated then normal. 

The basic weight for LEA k is the inverse of the probability of 
selection (P|^(sel) ) : 

P,^(sel)=l-(l-P^(Elem) ) X (l-P,^(Sec)) X ( 1-P,^ (Comb) ) 

where: P|^(Elem) is the probability of selecting LEA k in 

the elementary school stratum. This 
equals the sum of the school selection 
probabilities for the elementary schools 
in LEA k. If the sum is greater than one 
then Pj^(Elem) is set equal to one. 

P,^(Sec) is the probability of selecting LEA k in 
the secondary school stratum. This 
equals the sum of the school selection 
probabilities for the secondary schools 
in LEA k. If the sum is greater than one 
then Pj^(Sec) is set equal to one. 

Pj^(Comb) is the probability of selecting LEA k in 
the combined school stratum. This equals 
the sum of the school selection 
probabilities for the combined schools 
in LEA k. If the sum is greater than one 
then Pj^(Comb) is set equal to one. 
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LEAs Without Schools 



The basic weight for LEAs without schools was computed using the 
usual methodology for a single stage probability proportionate to 
size sample. 

LEA Basic Weights for Delaware. Nevada and West Virgin ia 

After the school selection in Delaware, Nevada and West Virginia, 
most LEAs were in the sample. Simulations showed that these 
states' LEA estimates would have a high variance in spite of most 
LEAs being in sample. To resolve this situation, all LEAs not 
selected in these three stages were added-on to the sample. The 
basic weights for all LEAs in these states were therefore equal 
to one. 

No schools, administrators or teachers were selected for schools 
that comprise these add-on LEAs. If an LEA estimate requires such 
data the basic weights equaled to one are not appropriate. As an 
example, say someone wants to know how many LEAs have schools 
with enrollments less Lhan 100 students and suppose there are m 
total LEAs. This question can only be answered for the LEAs which 
have schools selected, say there were n such LEAs. Using the 
basic weights equal to 1 would produce an estimate equaled to n, 
if all LEAs have at least one such school. The other m-n LEAs 
would be missing from the estimate. 

To handle LEA estimates requiring school, administrator or 
teacher data a second set of weights is available. The second set 
of weights uses the basic weight derived through the school 
selection described in the formula above. With these weights, the 
add-on schools have weight zero and are being represented by one 
of the n schools. Hence, data from the add-on LEAs are not 
required. 

Summarizing, when LEA estimates are produced solely using LEA 
data then the first set of weights, with basic weight equal to 1, 
are appropriate. However, if LEA estimates require administrator, 
school or teacher data, then the second set of weights should be 
used . 
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Teacher Questionnaire 



The final weight for public and private teachers is: 

(Basic Weight) X (Sampling Adjustment Factor) X (School 
Noninterview Factor) X (Bilingual Oversampling Factor) X (Late 
Mail Return Adjustment Factor) X (Teacher-Within-School 
Noninterview Adjustment Factor) X (Frame Ratio Adjustment 
Factor) 

Where: Basic Weight is the inverse of the probability of 

selection of the teacher in either the 
basic SASS sample or the bilingual/ESL 
supplement . If a teacher is selected 
in both samples then the basic SASS 
sample weight is usedo 



Sampling Adjustment 
Factor 



is an adjustment that accounts 
for unusual circumstances that 
affect the schools probability 
of selection, such as merger, 
split or duplication. It's the 
same factor used in the school 
weight • 



School Noninterview 
Adjustment 



is an adjustment that 
accounts for schools that did 
not have teachers selected 
because tea'^her lists v;ere 
not provided by the school. 
It's the weighted (school 
basic weight*school sampling 
adjustment factor) ratio of 
total eligible in-scope 
schools to the total in-scope 
schools providing teacher 
lists, computed within cells. 



Bilingual Oversampling 
Factor 



is an adjustment that 
accounts for the fact that 
bilingual teachers have two 
chances of selection. The 
adjustment is the inverse 
of the probability of 
selection of a teacher 
being selected in either 
basic or bilingual samples 
divided by the basic 
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weight. 



Late Mail Return 
Adjustment Factor 



is an adjustment that 
accounts for the fact that 
some sampled teachers were 
not selected for nonresponse 
followup. See below for more 
details • 



Teacher-wi thin- school 
noninterview adjustment 
factor 



Frame Ratio 
Adjustment Factor 



is an adjustment that 
accounts for sampled 
teachers that did not 
respond to the survey. 
It's the weighted (product 
of all previously defined 
components) ratio of the 
total eligible teachers to 
the total eligible 
responding teachers , 
computed within cells. 

This factor adjusts the sample 
estimates to known frame 
totals cf number of schools. 
These fa :tors are the same 
used in the school weight. See 
that section for more 
information . 



Late Mail Return Factor 

As stated before, not all non-mail return teachers were eligible 
for a telephone followup. The late mail return factor reflects 
this in the weight. The factor equals one for teachers responding 
by mail. For teachers not responding by mail by the close out 
date, and selected for telephone followup, the inverse of the 
probability of selection for telephone followup (1.75) is the 
late mail return factor. Some teachers responded by mail, even 
though they responded after the mail close-out date and were not 
selected for followup. Using the 1.75 late mail return factor, 
these teachers are being represented by telephone followup 
1-rachers and should be excluded from the sample. To make use of 
this data and to avoid a double counting of these mail return 
teachers, an appropriate proportion of the telephone followup 
teachers were randomly chosen not to get the late mail return 
factor. 
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The school noninterview and the teacher within-school 
noninterview adjustments are computed within cells. The frame 
ratio adjustments are the same as those used in the school 
weight. The cells for the frame adjustments are described in the 
school weight section. 

Public 

For public schools, the school noninterview adjustment cells 
were: state by school level by enrollment size class by 
urbanicity. If the fac or was less than 1.5 and there were at 
least 15 schools in the cell no collapsing was done. Otherwise, 
cells were collapsed (enrollment first, urbanicity second and 
level third) . The teacher within-school noninterview adjustment 
cells were: state by field of teaching by experience level (new 
vs. experienced teacher) by school urbanicity. If the factor was 
less than 1.5 and there were at least 15 teachers in the cell, no 
collapsing was done. Otherwise, cells were collapsed (urbanicity 
first, experience level second and field of teaching third) . 

Private 

For private list frame schools, the noninterview adjustment cells 
were: census region by affiliation (Catholic, other religious and 
nonsectarian) by grade level. If the factor was less than 1.5 and 
there were at least 15 schools in the cell, no collapsing was 
done. If collapsing was done, urbanicity was collapsed first, 
grade level second and affiliation last. The teacher 
within-school noninterview adjustment cells were: census region 
by affiliation (three levels) by field of teaching by experience 
level by urbanicity. If the factor was less than 1.5 and there 
were at least 15 teachers in the cell, no collapsing was done. If 
collapsing was done, urbanicity was collapsed first, teaching 
experience was collapsed second, field of teaching was collapsed 
third and affiliation was collapsed last. 

For private area frame schools, the school noninterview 
adjustment cells were: census region by affiliation (three 
levels) by grade .evel. Within census region, the unable-to- 
contact schools Vvere another cell by themselves. If the factor 
was less than 1.5 and there were at least 15 schools in the cell, 
no collapsing was done. If collapsing was done, grade level was 
collapsed first, and affiliation was collapsed last. The teacher 
within-school noninterview adjustment cells were: census region 
by affiliation (three levels) by field of teaching. Teachers from 
unable-to-contact schools were treated like another affiliation 
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in the cells. If the factor was less than 1.5 and there were 
least 15 teachers in the cell, no collapsing was done. If 
collapsing was done, field of teaching was collapsed first, 
affiliation was collapsed last. 
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Item Response Rates and Imputation 



The item response rates (e.g., number sampled units responding to 
an item divided by the number of responding oampled units) for 
the SASS surveys ranged from 11 percent to 100 percent. Tables 12 
and 13 provide a brief summary of the item response rates. 

Imputations were calculated for most missing items on the teacher 
demand and shortage and school files. Such imputations are 
flagged so that an analyst may exclude them, if desired. A 
sequential hot deck methodology was used for the imputation. This 
"nearest neighbor" approach matches the nonrespondent school or 
district with the most similar respondent in the same stratum. 
The variables used to match within a stratum were urbanicity, 
percent minority and enrollment. A response from a particular 
item was then imputed in one of two ways: the matched school or 
district's response was directly assigned to the nonresponse or 
it was used as an adjustment factor with other data the 
nonrespondent reported . 

On the public school file, all items were imputed. On the private 
school file items 7 and 35 were not imputed. On both the public 
and private teacher demand and shortage file items 3, 11, 12, 13 
and 28 were not imputed. A copy of an individual questionnaire 
can be obtained by looking at a NCES survey report or tape 
documentation from that questionnaire or requesting a copy from 
the NCES. 

Due to time constraints, the administrator and teacher files do 
not contain imputations for missing items. Instead, a missing 
data code is used to indicate missing items. At some later time 
when resources permit, new administrator and teacher files will 
be issued that co^ntain imputed values. 

Because the administrator and teacher files were not imputed, 
care must be taken when looking at estimates. Totals will be 
underestimates because missing values will be assumed to have a 
zero value. For averages, missing values will assume the average 
value of the variable of interest. This may represent an 
underestimate or overestimate. 
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Table 12 . Item response rates 



Survey 



Range of item 
response rates 



Percent of items 
with response 
of 90% or more 



Percent of 
items with 
a response less 
than 75% 



Teacher 
Demand and 
Shortage 
Survey 

Public 

Private 



40-100% 
16-100% 



74% 
70% 



12% 
18% 



Administrator 
Survey 

Public 

Private 



70-100% 
72-100% 



School Survey 

Public 43-100% 

Private 11-100% 



86% 
89% 

64% 
56% 



2% 
2% 

11% 
8% 



Teacher Survey 

Public 64-100% 

Private 60-100% 



90% 



89% 



1% 
1% 



The item response rates in this table are unweighted and do not 
reflect additional response loss due to complete questionnaire 
refusal . 
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Table 13. — Items with response rates less than 75 percent 



Teacher Demand 
and Shortage 



Public 
Private 



9b-9d, 9f, lOb-lOd, lOf and lib 
3, 9b-9d, 9f, lOb-lOd, lOf, lib 
22d 



and 



Administrator 
Survey 



Public 
Private 



18 
18 



School 
Survey 

Teacher 
Survey 



Public 16b(13) -(14) , 24a-24d, 31a-31e, 
32b-32d 

Private 19b (13) - (14) , 29e(l)-(2), 34a-34e, 
35b(l-27), 35c(28) and 35d(l-27) 

Public 14c and 28a(6) 
Private 14c and 28a(6) 



On the public use tape, items 9 and 10 from both Public and 
Private Teacher Demand and Shortage Survey; item 32 from the 
Public School Survey; item 3 5 on the Private School Survey; and 
item 28 on both Public and Private Teacher Survey have been 
deleted because of data reporting problems. 

A copy of an individual questionnaire can be obtained by looking 
at a NCES survey report or tape documentation from that 
questionnaire or requesting a copy from the NCES. 
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Variance Estimation 



Each SASS public use file includes a set of replicate weights 
designed to produce balanced half-sampled replicated variance 
estimates. The balanced half -sampled technique was used because 
software to produce such variance estimates is relatively common. 
The formula for the variance of a statistic Y is given below. 

Variance (Y) = 1/n S (Y -Y)^ 

r 

Where: Y^. is the estimate of Y using 
'' the r*^ set of replicate 
weights. 

n is the number of 
replicates. 

Below is a brief description of how the replicates were formrd. 
The first step in this process was to exclude all nonrespondj.ng 
units from the process. 

Replicates for Public Schools 

The public school file was placed into replicates by first 
forming 48 variance strata within each state. Each variance 
strata contained at least two schools which were alternately 
divided into two half-samples. To form the variance strata, 
certainty schools were placed in their own variance stratum 
where each certainty school was assigned to each half-sample. The 
noncertainty schools within a state/school level sampling stratum 
were sorted by the school's order of selection. Pairs of schools 
were then systematically placed into consecutive variance strata. 
When the sample size within a sampling stratum was larger than 96 
(48 X 2), contiguous variance strata were collapsed, so that the 
size of each variance stratum was as equal as possible, and until 
there were exactly 48 variance strata in the sampling stratum. 
Some variance strata, therefore, have more than two schools in 
them. The variance stratum numbering for the next sampling 
stratum started where the previous sampling stratum left off. 
When there was an odd number of noncertainty schools within a 
'-ampling stratum, one variance stratum was assigned an odd number 
of cases and adjustments were made to the replicate weights to 
account for this odd number of cases. After the variance stra .a 
were assigned, an orthogonal matrix was used to form the 48 
replicates. 
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Private School Replicates 



The sampling strata for private school were too numerous to use 
the method described for public schools. Instead of forming 
variance strata within sampling strata, it was done within 
collapsed strata. The collapsed strata were census region by 
school level by affiliation (13 groups). 

For list frame and certainty area frame ISHs, the following was 
done to form variance stratum half-samples: 

Within each collapsed stratum, noncertainty schools were 
sorted by sampling stratum by order of selection. Pairs of 
schools were then consecutively placed into 48 variance 
strata, each element of a pair being assigned to different 
half-samples. If a collapsed stratum had an odd number of 
noncertainty schools then one of the variance strata had an 
odd number of schools and an adjustment was made to account 
for this. Certainty schools were assigned to each half-sample. 

For noncertainty area frame PSUs, the following was done to form 
variance stratum half-samples: 

Within each PSU stratum, Westat had already paired the PSUs 
into half-samples. These pairings were consecutively assicned 
to variance strata, after sorting the pairs by the PSUs or sr 
of selection. 

After the variance strata were assigned, an orthogonal matrix was 
used to form the 48 balanced half-sampled replicates. 

Administrator Replicates 

Since nonresponding schools were excluded from the replication 
process, using the school replicates for administrators presents 
a problem. Responding administrators without responding schools 
would not have an assignment in the school replicates. To resolve 
this problem, administrators were placed into replicates 
independently, using the same school algorithm. Instead of 
excluding nonresp.onding schools, nonresponding administrators 
were excluded. 



Teacher Replicates 

Since some responding teachers' schools may not have responded to 
the SASS, the school replicates described above are not 
appropriate for teachers. Such teachers were not assigned to a 
variance stratum. For this reason, all responding SASS teachers 
were placed into replicates independently from the school 
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replicates. The first step in forming teacher replicates is to 
create a single school record when at least one teacher in a 
noncertainty school responded to SASS. These records were then 
placed into variance strata using the school variance strata 
algorithm described above. The responding teachers within a 
school were then assigned to the school's variance stratum 
half-sample. Teachers from certainty schools were placed into the 
same variance stratum, but were split into different 
half-samples. After the variance strata were assigned, an 
orthogonal matrix was used to form the 48 balanced half-sampled 
replicates . 

LEA Replicates 

To reflect the fact that LEAs were selected through the school, 
its important to form LEA replicates using the school replicate 
procedure. Since some LEAs may not have any responding schools, 
the school replicates are not appropriate for the LEA. Such LEAs 
were not assigned to a school variance stratum. The LEAs, 
therefore, were independently placed into replicates, using the 
school replicate procedures. First, a school record was formed 
for each responding school within a responding noncertainty LEA. 
Then, these records were placed into replicates using the school 
replicate algorithm. For each of these school replicates, an LEA 
was placed into the corresponding LEA replicate if any of the 
schools associated with the LEA were in that particular school 
replicate. Certainty LEAs were placed into all replicates. 

LEAs without schools were sorted by stratum and order of 
selection. Pairs of LEAs were then systematically placed into 
consecutive variance strata and each element of a variance strata 
were assigned to alternating half-samples. After the variance 
strata were assigned, an orthogonal matrix was used to form the 
48 replicates. 

Replicate We ights 

For school, administrator and teacher replicates, the nonzero 
replicate weights were either: 1) twice the final sampling weight 
for noncertainty sample units; or 2) the final SASS weight for 
certainty sample units. 

The noncertainty LEAs' replicate weights were assigned using the 
same LEA weight formula described in the weighting section, the 
only difference being that, for the basic weight calculation, 
each school stratum's selection probability for the LE/^ was 
divided by two. This is appropriate since the half-sample LEA 
selection probability within a school stratum is half of the 
respective full sample probability. Certainty LEA's replicate 
weights were the SASS final weight. 
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Cautions 



Replicated variance estimates assume sampling is done with 
replacement. For SASS, this was not the case. Unless the sampling 
rate is extremely high, the variance estiir ^tes should only be a 
slight overestimate. None of the public school sampling rates 
were high enough to provide a large variance overestimate. 
However, for some of the small private school affiliation strata, 
the sampling rates were high enough to possibly produce a large 
varia - overestimate. For this reason, consideration was given 
to adjusting the private school replicate weights to better 
estimate the vari'ance. The adjustment considered was the one 
appropriate for simple random sampling (i.e., 1-n/N where n and N 
are the sample and universe sizes, respectively) . 

An analysis was done to see if such an adjustment would improve 
the variance estimates. It was determined that it was unlikely 
that any private school estimate would be greatly reduced by an 
appropriate finite population correction. For more information on 
the analysis, see appendix 3. 
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Frame Evaluation 



When the first SASS estimates were produced inconsistencies were 
noticed between the SASS and the Common Core of Data (CCD) 
estimates. Some of these inconsistencies were caused from 
differences between the QED file (SASS frame) and the CCD file 
(the NCES's file of public schools and LEAs) . These differences 
are discussed below. 

School Estimates 

1) The definition of a school is different between the QED 
and CCD. QED defines a school as a physical location, 
while the CCD defines the school as an administrative 
unit. If two schools with separate principals reside in 
the same building, then QED (i.e., SASS) defines this as 
one school. CCD defines this as two schools. This 
definitional difference is most noticeable in Nebraska, 
where the CCD count was 44 percent larger than the SASS 
estimate. In North Dakota, the CCD count was 42 percent 
larger than the SASS estimate; in South Dakota, CCD was 
38 percent larger than the SASS estimate; and in Montana, 
CCD was 15 percent larger than the SASS estimate. These 
are the states with the largest differences. Other states 
may be affected by this definitional difference, but to a 
much smaller extent. On a national basis there are 6 
percent more CCD schools than SASS schools. 

2) for private schools, the SASS estimate of total number of 
schools is smaller than the number of schools represented 
on the frame. The differences are due to a combination of 
frame schools being out of scope, no longer in existence 
or duplicated on the frame, as discovered after the SASS 
collection. Table 14 provides a comparison. 
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Table 14. — Comparison between private school frame counts 

with SASS estimates of number of schools 



Source 



Number of schools 
on frame 



Estimated number 
of schools 
from SASS 



Total 



31,848 



26,807 



List frame 
excluding 
association 
add-ons 



22 , 600 



19,884 



Association 
add-ons 



1,586 



1, 018 



Area frame 
(weighted count) 



7, 662 



5,905 



LEA and School Estimates 

3) In Nebraska, QED excluded small (elementary grade only) 
LEAs. A match with CCD revealed 27 5 such LEAs were 
missing from the QED file. The missing LEAs represent 
2800 students with an average of 10.2 students per LEA. 
Since these LEAs are excluded from the SASS frame, they 
are not represented in SASS estimates. The schools 
associated with these LEAs are also not represented in 
SASS school estimates. These missing schools are 
contributing to the 44 percent QED underestimate for 
Nebraska stated above. 



LEA Estimates 

4) In Montana, QED collapsed LEAs with the same mailing 
address into one LEA. Usually, this was a single 
elementary school being collapsed with a high school. In 
Montana, CCD had 45 percent more LEAs than QED. All of 
this difference can be explained by the collapsing. It is 
unknown if other states are affected by this collapsing 
procedure . 
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5) In Louisiana, Arizona and Virginia, the CCD number of 

LEAs was approximately 20 percent larger than QED. This 
was caused by some of the sampled QED LEAs being out of 
scope (i.e., LEAs that don't hire teachers). 



Teacher Estimates 

6) In some states, the PTE or full-time equivalent (i.e., 

one full-time equivalent is equal to the amount of time a 
person would normally spend serving full-time on an 
assignment) teacher counts between the school and teacher 
file were not consistent. Part of the inconsistency can 
be attributed to two sources: 

a) The number of teacher head counts reported 
on the school file, on average, is higher 
than the number of teachers on the teacher 
frame given to Census by the school. In the 
average state, there were 5 percent fewer 
teachers on the teacher frame. This would 
cause head count or PTE estimates from the 
teacher file to be underestimates if the 
school did not include all teachers on the 
teacher frame. 

b) We believe that schools had problems 
providing PTE counts because it was not 
unusual for a school to report the same 
number of teacher head counts as PTE 
teachers when the school reported some 
part-time teachers. Given the way the 
questions were worded, it's possible this is 
alright, but given its frequency, it's 
unlikely. In an average state, 19 percent of 
the schools reported the same number of head 
counts as PTEs when some part-time teachers 
were reported. Since this seems to indicate 
that the reported PTE counts are higher than 
they really are SASS PTE estimates from the 
school file are likely to be overestimates. 
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Appendix 1 

Minimizing the School Overlap between the NAEP and NELS Samples 



This appendix describes how the original SASS selection 
probabilities, if no unduplication was required, were adjusted so 
that the number of overlap schools between SASS and NAEP or NELS 
samples is minimized without changing the overall probability of 
selection (i.e., the SASS school probability of selection 
averaged over all possible NAEP, NELS and SASS samples should 
equal the original selection probability) . To do this required 
knowledge about the NAEP and NELS selection probabilities. With 
this information for each QED school, the probability of not 
selecting a school in NAEP oi NELS can be determined. If the SASS 
original selection probability divided by the probability of not 
selecting the school in NAEP or NELS is less than one, and the 
school was selected in NAEP or NELS then the school was 
eliminated from _he frame. If the SASS original selection 
probability divided by the probability of not selecting the 
school in NAEP or NELS is greater than or equal to one, and the 
school was selected in NAEP or NELS then the school remained on 
the frame. However, the selection probability was reduced to 
minimize its chance of selection. Other QED schools not selected 
in NAEP or NELS had their selection probabilities adjusted to 
account for the unduplication. 

Since the overall probability of selection was the original 
school selection probability, the basic weights (weights without 
adjustment for nonresponse) are the reciprocal of the original 
school selection probability. There is no need to use the 
adjusted selection probabilities when weighting the school 
sample. 

The details of this process are described below. First, required 
terminology and sets of schools will be defined. Next, the 
definition of the adjusted probabilities of selection 
(conditional selection probabilities) will be defined. Selecting 
the SASS sample with these conditional selection probabilities 
will maintain the original selection probabilities over all 
possible NEAP, NELS and SASS school samples, while minimizing the 
overlap. 

Terminology 

A - NAEP sample 

E - NELS sample 

- SASS sample 
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i - school 

Fg - schools in the frame 

F^^ - schools in the A or E frames 

ae 

NS - not selected 
S - selected 

- original selection probability for school i without 
regard to the NAEP and NELS samples 

Pg. - is NAEP selection probability for school i 

p. (E j i NS in A) s NELS selection probability for school i 

• : iven that school i is not selected in the 
NAEP sample 

P-(NS) - probability of school i not being selected in the 
A or E samples 



Important Sets of Schools 



w = 


{i 


e 




i does not match any school in 


z = 


{i 


e 




i matches a school in F^^} 


X = 


{i 


e 




P^./PCNS) < 1} 


Y = 


{i 


e 


z| 


P3,./P(NS) > 1} 



Conditional Selection Probabilities 

In order to compute the SASS conditional selection probabilities 
Pj (NS) must be computed first. To do this, the NAEP and NELS 
selection probabilities (P^. and P,. (E \ i NS in A)) must be 
obtained from the NAEP and NELS contractors. Given these 
probabilities P,(NS) can be computed as follows: 

P. (NS)=P( i NS in A or E) 

=P. (NS in A)*P. ( NS in Ej i NS in A) 
= (i-P .)*(1-P,(E I i NS in A)) 
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This probability will be used to compute the SASS conditional 
selection probabilities described next. 



If i € W then 

P, (S in I A and E) = P^, 
If i € X then 

Pj (S in I A and E) = 



Pg,/P, (NS) if i NS in A or E 
0 otherwise 



If i € Y then 



P. (S in ] A and E) = 



1 if i NS in A or E 

(P^j-P,(NS))/(1-P,-(NS)) if i S in 

A or E 



The SASS sample was selected using these conditional selection 
probabilities. It is easy to verify that these probabilities will 
preserve the initial selection probabilities (P^j) while 
minimizing the NAEP and NELS sample overlap. 
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Appendix 2 



An Allocation of Schools and Teachers which Minimizes 
Loss of Relative Precision for Specific Estimators 



Background 

Estimates of school and teacher characteristics are obtained via 
a stratified design. Stratification variables consist of states 
X school level x sector. There are 51 states (including the 
D.C.), three levels (elementary, secondary and combined schools) 
and two sectors (public, private) indexed by i,j, and k 
respectively. 

Four classes of school characteristics and four classes of 
teacher characteristics are of interest. Specifically, these are 
the U.S. level, state comparisons, public v.s. private 
comparisons, and elementary v.s. secondary v.s. combined 
comparisons. Each of these estimates can be put into the same 
general form. De.note a specific school and teacher 
characteristic by "x" and "y", respectively. Denote strata means 
by x. y^ and y..^. Denote school and teacher stratum weights by 
and i/j.,^, respectively. An estimate of the following classes looks 
like: 

X = SSS Wjj,^ Xjj,^ for schools 
ijk 

and Y = SSS Uj^,^ Yi]\^ foi^ teachers 
ijk 

For example, to estimate the U.S. total take 
W. = # schools in strata i,j,k. 

IJK 

U... = # teachers in strata i,j,k. 

1 J K 

Similarly, a State i-verage would be based on 

Wjj,^ = # schools in strata iik 

# schools in State i. 

Uj.jj = # teachers in strata iik 

# teachers in State i. 

Denote S,. = # schools in strata ijk and 

1 J K 

T; = # teachers in strata ijk 

IJK 
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The specific weights used are in the following table of strata 
weights: 



Class C 
U.S. 1 

State:! 2 
Level :j 3 
Sector :k 4 



Schools 
Average Total 



Si 

S: 



Teachers 



Average 



T,jk/T. 



Total 

,ijk 
,ijk 
ijk 



When referring to estimates of a specific class, "c"; the 
corresponding weights will be denoted by W^.,. and U^.j. 

CIJK ClJIs 

Variances 



and 



V(X) = SSS W2.j^ V(X.jJ 
ijk 

V(y) = SSS U2..^ V(Y..,) 



V(x.j,^) is approximated by: 



(Assuming variances of measurements 
with the same level and sector are 
equal from state to state) 



(l-"ijk/Nijk) Vj,/n.., 
V(Y.jj^) is approximated by: 

(i-^iik/Niik) Qijk/"ijk + (1-^k/Mjk) Q2jk/("ijk ^k) 

Where n.^,^ and N.j,^ denote, respectively, the number of schools 
sampled and the total number of schools in strata ijk and m^^^ and 
Mj,^ denote, respectively, the sampled number of teachers/school 
and the average number of total teachers/school in strata j,k 
(assumed equal for states) . 

An estimate of Vj,^ was obtained as follows: 

To approximate the actual design, a variance term of the 
following form is nc:^eded: 



where : 



S2jk 



is the popu] -"tion variance. 



DEFF: 



jk - V^k/s jk 



(the variance under the desired 
design divided by the variance 
under simple random sampling) 
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since (omitting subscripts) 

CV = SE (EST) /EST (the standard error of an estir'.ate divided 

by the estimate) 



for proportions: SE(EST) = DEFF S/n, 
an estimate of S^ is: 

s^ = cv^ X n X estVdeff 
and so, V.,= n.,CV'., EST'., 

The calculations for totals are the same since estimates of 
totals consist of weighted means. From simulations, CV, DEFF and 
EST are obtained .for the school variables. 

The between and withxn teacher variances Q^^.,^ and Qjj^ were simply 
the variances obtained from Census which esiiimate the population 
variance of a pps sample with replacement. 



Separate Allocations For Each Estimate 

The eight variances to be minimized for a fixed cost are: 

v(x,) = ssi: v(x.j,) 

V(Y,) = SS2 U2.., V(y,jJ 
ijk 

denote: CS^ as the cost to sample a school in sector k, C^^biic 

includes the LEA cost and Cp^iyg^g is just the school 
cost, but with an allowance to cover response to 
LEA questions • 

CT as the marginal teacher cost (the cost to sample a 
single teacher once the school is in sample) . 

GC the total variable cost for the survey 



The problem is to choose n. schools and m^j^ teachers/school in 
stratum ijk to minimize V(X^) or V(Y^) subject to the cost 
constraint : 

2SS (CS^ n.j,^ + CTn.jj^ m.^^) = GC = Total Cost 
ijk 

Closed form solutions to obtain an allocation which minimizes 
only one variance at a time can be obtained by using Lagrange 
multipliers to incorporate the cost restriction (See Cochran, 
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Chapter 10 for example) 

a) separate s'^hool estimates: For an estimate of class "c", the 
optimal allocacion is: 

(W2,.jk (C^ + CT))l/2 

r\. = 

(C\ +CT) S (W2..^ Vj^ (C^ + CT))1/2 
ijk 

and, of course, = 1, since teachers characteristics are of no 
interest here. 

b) separate teacher estimates: 

For an estimate of class "c", the optimal allocation is: 

CS^ Q2j,^ , a trade-off of 

m^j,^ = between/withii costs and 

CT (Q^j.^ - Qjjk/Mjk) variances 



GC 

(C,+mj,CT) 

1/2 



"ijk = — ^ 



ijk 



1/2 



Compromise Allocation 

The idea is to pick a single allocation so that the resulting 
increase in variance for each estimate is minimized on average. 
This is the final allocation used. This idea is quantified as 
follows . 



Obtain the minimum variance under the optimal allocation specific 
to each estimate, as if that was the only estimate needed. The 
separate allocations needed to obtain these allocations have been 
outlined above in the section titled "Separate Allocations for 
Each Estimate'*. Call these variances VO(X^) . . . , VO(X^) , VO(Y^)... 
VO(YJ. 
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Choose a final allocation which minimizes the sum of the variance 
increases, i.e., minimize: 



V(X2) V(XJ V(Y,) 

+ +. . . + + 

VOCXj) VO(XJ VO(Y^) 



V(Xi; 
VO(X,) 

This can be rewritten as: 



2SS (SH2.., V(x..,) + TH^.j, V(y..,) 
ijk 



where 



and 



W 



SH 



2 _ 
ijk 



1i jk 



ci jk 



+ . . .+ --• 

VO(X,) V0(XJ 



V(YJ 
V0(YJ 



U' 



1i jk 



U' 



c i j k 



ijk 



VO(Y,) 



V0(YJ 



Which is a linear combination of the strata variances. 

A closed form solution isn't possible in this case. An iterative 
solution is employed which uses the number of teachers/school 
obtained for the separate allocations and updates it. 

The update is performed as follows: 

STEP 1) obtain initial estimate of m.^ and construct 
teacher fpc and relative state allocations. 

STEP 2) minimize variance function with respect to n.j^ and 
m.,^ given the teacher fpc and relative State 
allocation. 

STEP 3) Based on the new set of mj,^ which minimize the 

variance function, obtain a new teacher fpc and 
relative State allocations. 

STEP 4) go to step 2) and repeat until convergence. 
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Teacher Comparisons 



To account for an interest in particular fields of teachers, the 
following adjustment was made. For "f" fields of teaching, the 
increase in variance (if 1/f of the teachers are in each field) 
can be summarized as: 

Further, the components of the variance related to teachers in 
secondary or combined schools was multiplied by f to reflect the 
fact that field of study estimates for this group may be needed. 
In other words, when constructing TH, we actually used: 

fx 

'^^ VO(Yi) V0(Y2) V0(Y3) VO(YJ 

when j = (secondary or combined) . 



Variances 

Based on the optimal allocation, variances can be computed. 
Along with variances related to the estimates mentioned, LEA 
variances are calculated. 
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Appendix 3 



Effect of a Finite Population Correction on SASS Variance 
Estimates 

Replicated variance estimates assume sampling is done with 
replacement. For SASS, this was not the case. Unless the sampling 
rate is extremely high, the variance estimates should only be a 
slight overestimate. None of the public school sampling rates 
were high enough to provide a large variance overestimate. 
However, for some of the small private school affiliation strata, 
the sampling rates were high enough to possibly produce a large 
variance overestimate. For this reason, consideration was given 
to adjusting the private school replicate weights to better 
estimate the variance. The adjustment considered was the one 
appropriate for simple random sampling (i.e., 1-n/N, where n and 
N are the sample and universe sizes, respectively) , 

Some simple simulations were perforiried to measure the effect of 
such an adjustment when the sample was selected probability 
proportionate to size. First, a school frame was assumed. Then, 
all possible school samples were generated and an estimate of 
total schools was produced for each sample. In each sample, 66 
percent of the schools were selected,. From this, the true 
standard error was computed. 

Next, all i ossible replicate standard error estimates were 
computed for each sample. These standard error estimates were 
averaged to produce an average replicated standard error 
estimate. 

Finally, the average replicated estimate was adjusted 
appropriately to reflect a simple random sample finite population 
correction factor (adjusted replicated estimate) . With this 
information, it v/as possible to measure the error in both the 
unadjusted and adjusted replicated standard error estimates. 

The three tables below show the results: 
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Table 15« — Effect of finite population correction (fpc) 
where the distribution of probabilities is 
unequal and skewed 



School frame 



Elementary 
School Probability 

1 0.9 

2 0.9 

3 0.2 



Secondary 
School Probability 

4 0,9 

5 0.9 

6 0.2 



Two out of three schools were selected within 
each stratum using a probability proportionate 
to size sampling scheme. The probability column 
above provides the measures of size. 

From the set of all possible samples, the following 
numbers can be computed. 



(a) 
True 

std error 
2.199887 



(b) 

Average replicate Ratio of 

std error b to a 

2-459549 1. 118033 



Unadjusted, the replicated standard error 
overestimates the true standard error by 12 percent. 
If the replicated standard error is adjusted 
by the square root of (l-n/N) , where n and N are 
the sample and universe sizes, respectively then 
the following numbers are obtained- 



(c) 

fpc adjusted Ratio of 

replicate std error c to a 

1-420021 0.645497 



Now, the adjusted replicated standard error is 
underestimating the standard error by 35 percent- 



82 



60 



Table 16. 



— Effect of finite population correction (fpc) 
where the distribution of probabilities is 
unequal and not skewed 



School frame 



Elementary 
School Probability 

1 0.5 

2 0.7 

3 0.8 



secondary 
School Probability 

4 0.5 

5 0.7 

6 0.8 



Two out of three schools were selected within 
each stratum using a probability proportionate 
to size sampling scheme. The probability column 
above provides the measures of size. 

From the set of all possible samples, the following 
numbers can be computed. 



(a) (b) 

True Average replicate Ratio of 

std error std error b to a 

0.462yl0 0.707106 1.527525 



Unadjusted, the replicated standard error 
overestimates the true standard error by 53 percent. 
If the replicated standard error is adjusted 
by the square root of (1-n/N), where n and N are 
the sample and universe sizes, respectively then 
the following numbers can be computed. 



fpc adjusted Ratio of 

replicate std error c to a 

0.408248 0.881917 



Now, the adjusted replicated standard error is 
underestimating the true standard error by 12 percent. 
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Table 17. — Effect of finite population correction (fpc) 
where the distribution of probabilities is 
almost equal and not skewed 



School frame 

Elementary Secondary 
School Probability School Probability 

1 0.64 4 0.64 

2 0.67 5 0.67 

3 0.69 6 0*69 



Two out of three schools were selected within 
each stratum using a probability proportionate 
to size sampling scheme. The probability column 
above provides the measures of size. 

For the set of all possible samples, the following 
numbers can be computed. 

(a) (b) 

True Avi^rage replicate Ratio of 

std error std error b to a 

0. 066387 0. 113 32 8 1.70707 6 



Unadjusted, the replicated standard error 
overestimates the true standard error by 71 percent. 
If the replicated standard error is adjusted 
by the square root of (1-n/N), where n and N are 
the sample and universe sizes, respectively then 
the following numbers can be computed. 



(c) 

fpc adjusted Ratio of 

replicate std error c to a 

0.065430 0.985581 



Now, the adjusted replicated standard error is 
underestima\.ing the standard error by 1 percent. 
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The tables show that the adjusted standard errors underestimate 
the standard error, by as much as 35.5 percent (1-.645), if the 
selection probabilities are unequal and skewed. Since the 
underestimate can be so large, it's probably unwise applying an 
fpc adjustment. 

Another argument against applying an fpc adjustment is the effect 
the area frame sampling rate has on the overall sampling rate. 
The 551 area sample schools represents 24 percent of the total 
number of private schools. These schools come from a sample of 75 
out of 2,497 PSUs, a sampling rate of 3 percent. On average, one 
might expect that 24 percent of an affiliation's estimate would 
come from the area sample, with a 3 percent sampling rate. This 
should significantly reduce the overall sampling rate of 
affiliations with high list frame sampling rates. A similar 
si' . ation arises if schools from other list frame strata with low 
sampling rates, report belonging to an affiliation with a high 
list frame sampling rate. Table 18 below shows the impact of the 
area sample on the affiliation estimates with the highest list 
frame sampling rates. 

The table shows that except for Friends, military schools and 
Christian international schools, a significant part of the 
standard error comes from the area sample, which has a low 
overall sampling rate. Therefore, adjusting for a high list frame 
sampling rate should have minimal impact on the total variance. 
The high variance for Friends, military schools and Christian 
international schools is caused by schools from other list frame 
strata reporting they belong to Friends, military schools or 
Christian international schools. The variance contribution from 
the Friends and Military schools list frame stratum is zero since 
they were all selected with certainty. The variance is solely 
coming from schools from other list strata that make up part of 
the estimate. 

Because the overall affiliation sampling rate is greatly reduced 
by the low sampling rate of the area frame and some other list 
strata, and the fact that the proposed adjustment would produce a 
possibly large underestimate of the variance, the variance 
replicates were not adjusted for the high sampling rates. It is 
unlikely that any private school estimate would be greatly 
reduced by an appropriate finite population correction. 
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Table 18. — Estimate of number ol schools 
by list and area stratum 



List 


frame 


Area 


frame 


School Estimate 


Standard 


Estimate 


Standard 




error 




error 


Friends 73 


11 


2 


2 


Military schools 51 


19 


0 


0 


Exceptional children 199 


44 


109 


52 


Christian interna'tional 3 08 


49 


3 


3 


Episcopal 319 


31 


27 


5 


Montessori 4 67 


54 


216 


106 


Jewish 514 


23 


90 


52 
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